Reinforcement learning

Results: 1147



#Item
661Functional languages / Lisp programming language / Common Lisp / Cross-platform software / Reinforcement learning / Markov decision process / Ordinal number / Function / Lisp / Software engineering / Computer programming / Computing

Concurrent Hierarchical Reinforcement Learning Bhaskara Marthi, David Latham, Stuart Russell Carlos Guestrin Dept of Computer Science

Add to Reading List

Source URL: www.cs.berkeley.edu

Language: English - Date: 2004-09-02 13:55:11
662Operations research / Science / Dynamic programming / Markov processes / Stochastic control / Reinforcement learning / Mechanism design / Markov decision process / Vickrey–Clarke–Groves auction / Statistics / Control theory / Game theory

Approximately Efficient Online Mechanism Design David C. Parkes DEAS, Maxwell-Dworkin Harvard University

Add to Reading List

Source URL: www.eecs.harvard.edu

Language: English - Date: 2005-01-05 13:04:15
663Operations research / Cybernetics / Mathematical optimization / Search algorithms / Reinforcement learning / Perceptron / Dynamic programming / Regression analysis / Support vector machine / Machine learning / Statistics / Artificial intelligence

Learning for stochastic dynamic programming Sylvain Gelly and J´er´emie Mary and Olivier Teytaud ∗ IA-TAO, Lri, Bˆ at. 490,

Add to Reading List

Source URL: eprints.pascal-network.org

Language: English - Date: 2006-11-07 05:40:56
664Control theory / Mathematical optimization / Equations / Reinforcement learning / Automated planning and scheduling / Anytime algorithm / Algorithm / Dynamic programming / Shortest path problem / Mathematics / Operations research / Applied mathematics

From: AAAI-93 Proceedings. Copyright © 1993, AAAI (www.aaai.org). All rights reserved. Planning Thomas Wit

Add to Reading List

Source URL: aaai.org

Language: English - Date: 2006-01-09 21:10:32
665Stochastic optimization / Markov models / Reinforcement learning / Variance / Algorithm / Statistics / Machine learning / Multi-armed bandit

multi-bandit_techreport.dvi

Add to Reading List

Source URL: www.princeton.edu

Language: English - Date: 2011-10-26 19:05:07
666Reinforcement learning / Robotics / Mobile robot / Robot / Reinforcement / Action learning / E-learning / Behavior / Learning / Education / Behaviorism

Copyright by Jefferson Provost 2007 The Dissertation Committee for Jefferson Provost

Add to Reading List

Source URL: ftp.cs.utexas.edu

Language: English - Date: 2007-08-17 16:29:06
667SARSA / Q-learning / Reinforcement learning / Temporal difference learning / Machine learning / Algorithm / Apprenticeship learning / Spatial memory / Artificial intelligence / Learning / Mathematics

Learning to Follow Navigational Directions Adam Vogel and Dan Jurafsky Department of Computer Science Stanford University {acvogel,jurafsky}@stanford.edu

Add to Reading List

Source URL: nlp.stanford.edu

Language: English - Date: 2010-05-17 17:59:35
668Stochastic control / SARSA / Markov models / Theoretical computer science / Reinforcement learning / Q-learning / Council on Environmental Quality / Temporal difference learning / Partially observable Markov decision process / Statistics / Markov processes / Dynamic programming

Consistent exploration improves convergence of reinforcement learning on POMDPs Paul A. Crook Gillian Hayes

Add to Reading List

Source URL: homepages.inf.ed.ac.uk

Language: English - Date: 2007-07-04 12:19:49
669Stochastic control / Partially observable Markov decision process / Markov decision process / Markov model / Dynamical system / Reinforcement learning / Pruning / Linear programming / Constraint algorithm / Statistics / Dynamic programming / Markov processes

Planning in Models that Combine Memory with Predictive Representations of State

Add to Reading List

Source URL: aaai.org

Language: English - Date: 2006-01-11 01:12:22
670Stochastic optimization / Markov models / Variance / Reinforcement learning / Statistics / Machine learning / Multi-armed bandit

Multi-Bandit Best Arm Identification Victor Gabillon Mohammad Ghavamzadeh Alessandro Lazaric INRIA Lille - Nord Europe, Team SequeL

Add to Reading List

Source URL: www.princeton.edu

Language: English - Date: 2011-10-26 19:05:10
UPDATE